Managing speech databases with emur and the EMU-webapp
نویسنده
چکیده
As is the nature of the discipline, a majority of speech and language researchers spend a large amount of their time acquiring and transforming data into analyzable and interpretable forms to gain a better understanding of a certain subject matter. In this paper we present a collection of tools that aid the researcher in this sometimes tedious and error-prone process. The tools presented here are part of the next iteration of the EMU speech database management system which aims to be as close to an all-in-one solution for generating, manipulating, querying, analyzing and managing speech databases as possible.
منابع مشابه
EMU: an Enhanced Hierarchical Speech Data Management System
EMU is a system for labelling, managing and retrieving data from speech databases such as the Australian ANDOSL database or the US TIMIT. EMU is a re-implementation of the earlier MU+ system (Harrington, Cassidy, Fletcher, and McVeigh 1993) with the aim of providing a more flexible environment. The hierarchical structures and database query facility have been generalised and the system has been...
متن کاملCritical roles of the immunoglobulin intronic enhancers in maintaining the sequential rearrangement of IgH and Igk loci
V(D)J recombination of immunoglobulin (Ig) heavy (IgH) and light chain genes occurs sequentially in the pro- and pre-B cells. To identify cis-elements that dictate this order of rearrangement, we replaced the endogenous matrix attachment region/Igk intronic enhancer (MiE(kappa)) with its heavy chain counterpart (Emu) in mice. This replacement, denoted EmuR, substantially increases the accessibi...
متن کاملEMU-SDMS: Advanced speech database management and analysis in R
The amount and complexity of the often very specialized tools necessary for working with spoken language databases has continually evolved and grown over the years. The speech and spoken language research community is expected to be well versed in multiple software tools and have the ability to switch seamlessly between the various tools, sometimes even having to script adhoc solutions to solve...
متن کاملCompiling multi-tiered speech databases into the relational model: experiments with the emu system
The Emu speech database system enables the annotation of speech signals at many levels of detail and provides a mechanism for making links between these levels to produce a hierarchical annotation. Emu provides facilities for searching collections of these annotations according to both sequential and hierarchical criteria. The results of a search can be used to retrieve acoustic and other data ...
متن کاملمراحل و نحوه ی تهیه ی دادگان های صوتی هجایی و دایفونی برای سامانه ی تبدیل متن به گفتار فارسی
Abstract Speech databases are part of the concatenative text to speech synthesis systems. Phonetic quality of the databases plays a significant role in the naturalness of the synthesized speech. This paper introduces two syllable and diphone speech databases for Persian and investigates the way of their development and their specifications and their advantages to each other. ...
متن کامل